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What is claimed is: 

1 . A computer implemented method for calculating a normalization factor 
comprising: 

providing a first intensity value (Z^) of a probe in a first probe array and a 
second intensity value (f 2) ) of said probe in a second probe array; 

obtaining the geometric mean (x) of said 

calculating said normalization factor according to: 

f(x) = e h{x) , wherein said h(x) is derived from referential intensities from 
said first and second probe arrays. 



The method of Claim 1 wherein said h(x) is derived by relating geometric means 
(Xi') of first referential intensities (7?//^) in the first probe array and second 
referential intensities {Rlj 2) ) in the second probe array to: 



( m& ^ 



y l =log 



EL 

Rl 
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3. The method of Claim 2 wherein said relating comprising: 

sorting (jc/, yd pairs according to x t into a plurality (m number) of bins with no 
overlapping; 

computing medians ( x k ) of xfs and medians (y k )of yt 's for each bin; and 
interpolating said medians (x k , y k ). 
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The method of Claim 3 wherein said bins are of approximately equal size. 



The method of Claim 4 wherein said h(x) is: 



h(x) = 



if x<x\ 



wy x + (1- w)y 1+p if xe (x u xi + 1), w = 



Jtj + 1 - x 



,i = l,...,m-l, 



+ 1 - j: j 



The method of Claim 5 wherein said m is 3. 



A computer implemented method for comparing the expression of a gene in a first 
sample with a second sample comprising: 

providing a first plurality of intensity values ( I® ), each of which reflects 
the expression of said gene in said first sample, wherein said intensity values are 
obtained from a first nucleic acid probe array; 

providing a second plurality of intensity values ( J, (2) ), each of which 
reflects the expression of said gene in said second sample, wherein said intensity 
values are obtained from a second nucleic acid probe array; 

calculating a p-value using one-sided Wilcoxon's signed rank test, wherein 
the rvalue is for a null hypothesis that median(f(x) l[ 2) - 1™ )=0 and an alternative 
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hypothesis that median((f(x) 7, (1) - 1\ 2) )>0, wherein said/fr) is a normalization 
factor; and 

indicating whether said transcript is present based upon said p-value. 

8. The method of Claim 7 further comprising a step of calculating normalization 
factor, said step comprising: 

obtaining the geometric mean (x) of said 7, (1) and said 7, (2) ; 
calculating said normalization factor according to: 
f(x) = e h(x) , wherein said h(x) is derived from referential intensities from 
said first and second probe arrays. 

9. The method of Claim 8 wherein said h(x) is derived by relating geometric means 
(XiO of first referential intensities in said first probe array and said second 
referential intensities (Rl! 2) ) in said second probe array to: 



10. The method of Claim 9 wherein said relating comprising: 

sorting {x t , yd pairs according to x t into a plurality (m number) of bins with no 
overlapping; 

computing medians (x k )of x ( 's and medians ( y k ) of y/s for each bin; and 
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interpolating said medians (x k , y k ). 



The method of Claim 10 wherein said bins are of approximately equal size. 



The method of Claim 1 1 wherein said h(x) is: 



h(x) = 



_ if x < xi 

wy i + (l-w)y i+v ifxe(x iy Xt +l],w = = =- ,i = l,..-,w-l, 

ifx>X m . 



x i + 1 + 



The method of Claim 12 wherein said m is 3. 



A system for calculating a normalization factor comprising: 
a processor; and 

a memory coupled with the processor, the memory storing a plurality of 
machine instructions that cause the processor to perform a plurality of logical 
steps when implemented by the processor, the logical steps comprising: 

providing a first intensity value (f) of a probe in a first probe array and a 
second intensity value (f) of said probe in a second probe array; 

obtaining the geometric mean (x) of said f } and said P } ; 

calculating said normalization factor according to: 

f(x) = e h(x) , wherein said h(x) is derived from referential intensities from 
said first and second probe arrays. 
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15. The system of Claim 14 wherein said h(x) is derived by relating geometric means 
(pa') of first referential intensities (RIi (1> ) in the first probe array and second 
referential intensities (Rip) in the second probe array to: 



16. The system of Claim 15 wherein said relating comprising: 

sorting (pa, yd pairs according to x t into a plurality (m number) of bins with no 
overlapping; 

computing medians (x k ) of xfs and medians ( y k ) of y t 's for each bin; and 
interpolating said medians (x k ,y k ). 

17. The system of Claim 16 wherein said bins are of approximately equal size. 

18. The system of Claim 17 wherein said h(x) is: 



y, =iog 




if x < xi 



h(x) = < 



wy x + (1 - w)y : 



l+p 



if XE (Xi,Xi+i),W = = 



— ,i = l,...,m-l, 



tf X ^ Xm • 



19. The system of Claim 18 wherein said m is 3. 
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20. A system for comparing the expression of a gene in a first sample with a second 
sample comprising: 

a processor; and 

5 a memory coupled with the processor, the memory storing a plurality of 

machine instructions that cause the processor to perform a plurality of logical 
steps when implemented by the processor, the logical steps comprising: 

providing a first plurality of intensity values each of which reflects 

the expression of said gene in said first sample, wherein said intensity values are 
10 obtained from a first nucleic acid probe array; 

providing a second plurality of intensity values ( 7 ( (2) ), each of which 
reflects the expression of said gene in said second sample, wherein said intensity 
values are obtained from a second nucleic acid probe array; 

calculating a p-value using one-sided Wilcoxon's signed rank test, wherein 
15 the p-value is for a null hypothesis that median(f(x) 7, (2) - J™ )=0 and an alternative 

hypothesis that median((j(x) 7, (1) - 7< 2) )>0, wherein said fix) is a normalization 
factor; and 

indicating whether said transcript is present based upon saidp-value. 

20 21. The system of Claim 20 further comprising a step of calculating normalization 
factor, said step comprising: 
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obtaining the geometric mean (x) of said and said lj 2) ; 
calculating said normalization factor according to: 
f(x) = e h(x) , wherein said h(x) is derived from referential intensities from 
said first and second probe arrays. 

22. The system of Claim 21 wherein said h(x) is derived by relating geometric means 
Or, ') of first referential intensities (Rll 1 ) in said first probe array and said second 
referential intensities (Rlf 2 ) in said second probe array to: 



y, =iog 



p/(2) 

y RI ' j 



23 . The system of Claim 22 wherein said relating comprising: 

sorting (xi, y,) pairs according to into a plurality (m number) of bins with no 
overlapping; 

computing medians (x k ) of x t 's and medians ( y k ) of y t 's for each bin; and 
interpolating said medians (x k ,y k ). 



24. The system of Claim 23 wherein said bins are of approximately equal size. 



25. The system of Claim 24 wherein said h(x) is: 
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_ if x < xi 
h(x) = \wy t +(l- w)y l+1 , if xe (x l9 xt + 1], w = - — , i = l,...,m - 1, 

ifx>Xm. 



The system of Claim 25 wherein said m is 3. 



A computer software product for calculating a normalization factor comprising: 

computer program code for providing a first intensity value of a probe 
in a first probe array and a second intensity value (P } ) of said probe in a second 
probe array; 

computer program code for obtaining the geometric mean (x) of said t l) 
and said t 2) ; 

computer program code for calculating said normalization factor according 

to: 

f(x) = e Hx) , wherein said h(x) is derived from referential intensities from 

said first and second probe arrays; and 

a computer readable medium for storing said codes. 

The computer software product of Claim 27 wherein said h(x) is derived by 
relating geometric means (x t ') of first referential intensities (RIi (1) ) in the first 
probe array and second referential intensities {Rlj 2} ) in the second probe array to: 
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p/(2) 

K * J 



29. The computer software product of Claim 28 wherein said code for relating 
comprising: 

computer program code for sorting {x u yd pairs according to x t into a plurality (m 
number) of bins with no overlapping; 

computer program code for computing medians ( x k ) of Xi's and medians ( y k ) of 

yts for each bin; and 

computer program code for interpolating said medians (x k , y k ). 



30. The computer software product of Claim 29 wherein said bins are of 
approximately equal size. 



31. The computer software product of Claim 30 wherein said h(x) is: 



h(x) = 



_ if x< x\ 

y n 

wy x + (1 - w) y 1+1 , if x € (xi , xi + 1) , w = 

if X ^ Xm * 



+ 1 - X j 



,i' = l,...,m-l, 



32. The computer software product of Claim 31 wherein said m is 3. 
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A computer software product for comparing the expression of a gene in a first 
sample with a second sample comprising: 

computer program code for providing a first plurality of intensity values 
( ), each of which reflects the expression of said gene in said first sample, 
wherein said intensity values are obtained from a first nucleic acid probe array; 

computer program code for providing a second plurality of intensity values 
(7 ( (2) ), each of which reflects the expression of said gene in said second sample, 
wherein said intensity values are obtained from a second nucleic acid probe array; 

computer program code for calculating a p-value using one-sided 
Wilcoxon's signed rank test, wherein the p-value is for a null hypothesis that 
median(j(x)lj 2) - )=0 and an alternative hypothesis that median((f(x) if 0 - 
/ ( (2) )>0, wherein said/fx) is a normalization factor; 

computer program code for indicating whether said transcript is present 
based upon said p-value; and 

a computer readable medium for storing said codes. 

The computer program code of Claim 33 further comprising computer program 
code for calculating normalization factor, said code comprising: 

code for obtaining the geometric mean (x) of said and said 7< 2) ; 

code for calculating said normalization factor according to: 
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f(x) = e h(x) , wherein said h(x) is derived from referential intensities from 
said first and second probe arrays. 

35. The computer software product of Claim 34 wherein said h(x) is derived by 
5 relating geometric means (jc,') of first referential intensities (RI, (1> ) in said first 

probe array and said second referential intensities (Rlj 2} ) in said second probe 
array to: 



y, =log 



c ml} 



10 36. The computer software product of Claim 35 wherein said code for relating 
comprising: 

computer code for sorting (x if yd pairs according to x t into a plurality (m number) 
of bins with no overlapping; 

computer code for computing medians (x k ) of x t 's and medians ( y k ) of y t f s for 

15 each bin; and 

computer code for interpolating said medians ( x k , y k ). 



37. The computer software product of Claim 36 wherein said bins are of 
approximately equal size. 

20 
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38. The computer software product of Claim 37 wherein said h(x) is: 



h(x) = 



_ if x < x\ 

y\ > _ _ x i+\ - x 

wy,+(l-iv)y i+1 , if xe(x i9 Xi+l] 9 w = ^ =- ,i = l,...,wi-l, 

Z^ 7 "' ifx>X m . 



x l + l +-x t 



39. The computer software product of Claim 38 wherein said m is 3. 
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